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We report here the first characterization of a gene 
encoding a homogentisate dioxygenase, the Aspergillus 
nidulans hmgA gene. The HmgA protein catalyzes an 
essential step in phenylalanine catabolism, and disrup- 
tion of the gene results in accumulation of homogenti- 
sate in broths containing phenylalanine. hmgA puta- 
tively encodes a 448-residue polypeptide (A£ = 50,168) 
containing 21 histidine and 23 tyrosine residues. This 
polypeptide has been expressed in Escherichia coli as a 
fusion to glutathione 5-transferase, and the affinity-pu- 
rified protein has homogentisate dioxygenase activity. 

A. nidulans, an ascomycete amenable to classical and 
reverse genetic analysis, is a good metabolic model to 
study inborn errors in human Phe catabolism. One such 
disease, alkaptonuria, was the first human inborn error 
recognized (Garrod, A. E. (1902) Lancet 2, 1616-1620) 
and results from loss of homogentisate dioxygenase. 
Here we take advantage of the high degree of conserva- 
tion between the amino acid sequences of the fungal and 
higher eukaryote enzymes of this pathway to identify 
expressed sequence tags encoding human and plant ho- 
mologues of HmgA. This is a significant advance in char- 
acterizing the genetic defect(s) of alkaptonuria and il- 
lustrates the usefulness of our fungal model. 



The physiologically versatile filamentous ascomycete Asper- 
gillus nidulans is able to grow on Phe or PhAc 1 as the sole 
carbon source. The A. nidulans Phe catabolic pathway is nota- 
bly similar to its human counterpart (Fernandez-Cariun and 
Penalva, 1995). As in humans (Fig. 1), Phe is degraded to 
homogentisate (2,5-dihydroxy-PhAc). The aromatic ring is then 
cleaved by homogentisate dioxygenase to yield, after an isomer- 
ization step, fumarylacetoacetate, which is split by fumarylac- 
etoacetate hydrolase into fumarate and acetoacetate (see Fig. 
1). Aspergillus can also catabolize PhAc through homogentisate 
after two sequential hydroxylation reactions in the aromatic 
ring. 2 This PhAc pathway is absent in humans. 

Humans are very sensitive to defects in Phe catabolism. 
Loss-of-function mutations in structural genes of this pathway 
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cause different metabolic diseases. Alkaptonuria is one such 
disease, resulting from loss of homogentisate dioxygenase (EC 
1.13.11.15) (La Du et al. t 1958). This moderately disabling 
disease, whose main clinical features are darkening of the 
urine, pigmentation of cartilages, and arthritis in adults, was 
the first inborn error of metabolism to be described (Garrod. 
1902). However, the gene encoding homogentisate dioxygenase 
has not been characterized from humans or any other organism 
(see McKusick (1994)). Therefore, definitive evidence that the 
disease results from a loss-of-function mutation in this gene 
has not yet been obtained. Type I tyrosinaemia, resulting from 
fumarylacetoacetate hydrolase deficiency, is a different defect 
in human Phe catabolism with severe consequences. Our char- 
acterization of the faliA gene, encoding A. nidulans fumaryl- 
acetoacetate hydrolase, showed 47% identity at the amino acid 
level with its human homologue (Fern£ndez-Can6n and 
Penalva, 1995). Loss of this enzyme results in phenylalanine 
toxicity and extracellular accumulation of succinylacetone, a 
hallmark of the disease in the urine of human patients. The 
similarities in the overall organization of the Phe pathway, in 
the amino acid sequences for at least one enzyme, and in the 
consequences of equivalent genetic blocks between A. nidulans 
and humans prompted us to use this fungus as a model for 
certain metabolic aspects of human defects in Phe catabolism. 
Here we use this lower eukaryote to characterize, for the first 
time, a gene encoding a homogentisate dioxygenase and use its 
deduced amino acid sequence to identify its human and plant 
homologues. 

EXPERIMENTAL PROCEDURES 

Strains, Media, and Growth Conditions—A. nidulans biA 1 was used 
as wild type and source of cDNA. A biA I, inethGL argB2 strain was 
used as recipient for hmgA disruption, and a biA J, methGl strain was 
used as control in experiments involving the hmgA" at gB* disrupted 
strain. A. nidulans standard media (Cove, 1966) were used for strain 
maintenance, growth tests, and transformation experiments. Supple- 
ments were added when necessary. For transfer experiments, Aspergil- 
lus liquid cultures were grown at 37 °C with vigorous shaking in an 
appropriately supplemented defined medium containing (in g/liter) 
KP0 4 H 2 (13.6), (NH 4 ) 2 S0 4 (2.0), MgSO<-7H 2 0 (0.25). and FeS0,-7H 2 0 
(0.0005). with glucose at 0.3% (w/v) as carbon source. After 15-16 h of 
growth, mycelia were collected and transferred to fresh medium with no 
carbon source added. After a 1-h incubation, appropriate inducing or 
non-inducing carbon sources were added at the following concentra- 
tions: glucose at 1% (w/v); Phe. PhAc, all other PhAc hydroxy deriva- 
tives, and glutamate at 10 mM; Tyr at 5 mM, and potassium acetate at 
30 mM. These cultures were further incubated for the times indicated in 
each case. 

DNA Sequencing and Sequence Analysis — DNA was sequenced man- 
ually using the dideoxynucleotide chain termination method (Sanger et 
al. t 1977) with T7 DNA polymerase (Pharmacia Biotech Inc.) and 3S S- 
dATP (Amersham Corp.) on pBluescript SK* double-stranded tem- 
plates. Primers were either universal or a series of oligonucleotides 
specific for both strands of the hmgA locus. Amino acid sequence sim- 
ilarities were found using BLAST (Altschul et al. 1990) and the trans- 
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Fig. 1. The Phe/PhAc pathway in A nidulans. The enzymatic 
reactions involved in Phe catabolism are the same as in humans. 
Humans do not use the PhAc pathway. 

lation of nucleotide sequences in Gen Bank and EMBL data bases. 
Alignments were further improved by' eye. Only amino acid identities 
were considered. 

Transformation Techniques and Gene Disruption— A. nidulans was 
transformed following the method of Til burn et al (1983). hmgA was 
disrupted by transformation with an ar#S + -containing plasmid carry- 
ing a 1 .05-kilobase Clal-EcoRl fragment internal to the hmgA tran- 
scriptional unit. This fragment starts within the first intron splitting 
the coding region (at residue 23) and ends at residue 347. Thus, of the 
two incomplete genes generated after targeting this plasmid to hmgA, 
one lacks the 101 C- terminal codons while the second lacks the pro- 
moter region, the first exon, and part of the first intron. 

cDNA Cloning, Libraries, and Transcript Analysis— hmgA cDNA 
clones were isolated from a AgtlO cDNA library made with transcripts 
isolated from PhAc-induced cells. This library was screened with "plus" 
and "minus" cDNA probes as described (Fernandez- Canon and Penalva, 
1995). The "plus" probe was a subtracted cDNA probe obtained follow- 
ing Zimmermann et al. (1980) and Sargent (1987). Mycelia for RNA 
isolation were obtained after transfer to indicated carbon sources as 
described above and further incubation for 1 h at 37 °C. RNA isolation 
and Northern analysis were essentially as in Espeso and Penalva 
(1992). The hmgA probe for Northern analysis and isolation of genomic 
clones (from a AEMBL4 library) was a cDNA clone insert containing the 
complete hmgA ORF. 

Homogentisate Dioxygenase Assays— The enzyme was assayed spec- 
trophotometrically. following formation of maleylacetoacetate at 330 nm 
(Edwards and Knox, 1955). Mycelia pregrown in glucose were trans- 
ferred to 10 mM PhAc defined medium (strongly inducing conditions) 
and incubated for 1 h at 37 X. Lysis was achieved by sonication in 100 
mM potassium phosphate buffer, pH 7.0. Extracts were clarified by 
centrifugation (13,000 x g, 10 min, 4 °C) and used in enzyme assays, 
which contained, in a final volume of 1 ml, 100 mM potassium phosphate 
buffer, pH 7.0, 2 mM ascorbate, 50 jam FeS0 4 , 250 /iM homogentisate, 
and 50 jxl of extract (50-75 fig of protein). Recombinant 
GST::HmgA(9-448) enzyme expressed in Escheiichia coli was assayed 
by the same method. 

Construction of GST Fusion Proteins and Expression in E. coli— A 
cDNA insert containing the complete ORF (including the stop codon) 
was obtained from a cDNA clone as a Nod fragment. Ends were filled 
in with Klenow polymerase, and the fragment was cloned in the filled- 
in Aval site of pGEX-2T (Pharmacia). One of the two possible orienta- 
tions resulted in a plasmid driving expression in E. coli of a 
GST::HmgA(9-448) fusion protein (predicted molecular mass, 76 kDa). 
This plasmid was named pGEX::HMGA. The opposite orientation (plas- 
mid pGEX::AGMH) encodes an aberrant fusion protein terminating 
shortly after the end of the N- terminal GST moiety. A third pGEX 
derivative (pGEX::+2) was constructed in which the hmgA ORF was 
fused to GST in the correct translational orientation but in the incorrect 
reading frame, resulting in premature termination 13 residues after the 
end of GST. E. coli strains carrying each of the above pi asm ids were 
cultured at 28 °C in LB broth with ampicillin (100 jig-ml -1 ) until an 



A 6ao = 0.6 was reached. At this point 100 ^lm IPTG was added and 
cultures incubated for 3 additional hours at 28 °C. Cells were chilled in 
ice, concentrated by centrifugation and resuspension in 15 mM potas- 
sium phosphate buffer, pH 7.0, 150 mM NaCl, and lysed by sonication. 
Extracts were clarified by centrifugation (13,000 X g, 30 min, 4 °C) and 
either used directly in enzyme assays or purified (in the case of bacteria 
carrying pGEX::HMGA) by glutathione-Sepharose (Pharmacia) affinity 
chromatography, essentially as described by the manufacturer. Elution 
buffer was exchanged with 15 mM potassium phosphate, pH 7.0, 150 mM 
NaCl using a Pharmacia PD-10 column before using the purified prep- 
aration in enzyme assays. 

HPLC Analysis of Culture Filtra tes— Mycelia of the hmgA::argB+ 
and of the hmgA* (otherwise isogenic) control strain were pregrown in 
defined medium with glucose, transferred to flasks containing this 
medium with 25 mM Phe as sole carbon source, and incubated for 8 
additional hours at 37 °C. Culture supematants were filtered through 
0.45-jim pore size filters. 20-jd samples of 1:5 dilutions were analyzed 
with an HPLC apparatus equipped with a variable wavelength UV- 
visible detector and a Nucleosil 300-5 C-18 column (4 mm (inner 
diameter) x 250 mm), which was coupled to a 1 1 x 4 -mm precolumn of 
the same support. The mobile phase was 50 mM sodium phosphate 
buffer (pH 6.5)/methanol (80:20, v/v) at a flow rate of 0.5 ml-min" 1 . 

RESULTS 

Cloning and Molecular Characterization of the hmgA 
Gene—\Ne have used a subtraction procedure to isolate cDN As 
for A. nidulans genes whose transcription is induced by utili- 
zation of PhAc as sole carbon source (Fernandez-Canon and 
Penalva, 1995). A class of these cDNAs was present in the 
AgtlO library with a relative abundance of 2%. Nucleotide 
sequencing of overlapping cDNA clones and comparison with 
the corresponding genomic sequences revealed the presence of 
a long ORF interrupted by three short introns, 48, 54, and 49 
nucleotides, respectively (Fig. 2). This ORF can encode a 448- 
residue polypeptide, whose amino acid sequence showed no 
significant similarity to sequences with an assigned function 
deposited in SwissProt and PIR data bases or those obtained by 
conceptual translation of GenBank/EMBL data base entries 
(but see also below). 

Northern blot hybridization experiments (Fig. 3) showed 
that this gene encodes an ~1.7-kilobase message whose tran- 
scription pattern conforms to that expected for a gene of Phe/ 
PhAc catabolism. The gene was strongly induced by PhAc, Phe, 
and certain monohydroxy derivatives of PhAc and weakly in- 
duced by PhAc dihydroxy derivatives (on which the fungus 
grows very poorly). Transcription was undetectable under car- 
bon starvation conditions or in the presence of either glucose or 
either of two gluconeogenic carbon sources (acetate and gluta- 
mate). This pattern strongly suggests that expression of this 
gene is specifically induced by PhAc and Phe or by a common 
catabolite. Glucose only slightly reduces induction by PhAc. 
This transcription pattern is identical to that of the fahA gene 
(Fernandez-Canon and Penalva, 1995). Indeed, Southern blut 
hybridization experiments using DNA from purified phages 
from a AEMBL4 genomic library showed that this gene is 
closely linked to fahA, encoding A. nidulans fumarylacetoac- 
etate hydrolase. Nucleotide sequencing demonstrated that both 
genes were transcribed divergently, their corresponding ORFs 
being separated by an intergenic region of 415 base pair(s), 2 
presumably containing common elements controlling tran- 
scription of both genes. Clustering of genes belonging to the 
same metabolic pathway is not unusual in filamentous asco- 
mycetes. This, together with the above transcription data, 
strongly suggested that we had isolated a previously unchar- 
acterized gene of the Phe/PhAc catabolic pathway. The deduced 
molecular mass of its encoded polypeptide is 50,168 Da, which 
is very similar to that of 49 kDa estimated by SDS-polyaciyl 
amide gel electrophoresis for purified murine homogentisate 
dioxygenase (Schmidt et al, 1995). Thus, we suspected that 
this gene might encode a fungal homogentisate dioxygenase. 
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Fig. 2. Nucleotide sequence of the hmgA gene and amino acid sequence of its putative translation product. The nucleotide sequence 
shown was determined in both strands by sequencing a genomic and several overlapping cDNA clones covering the complete open reading frame. 
Coding and non- coding sequences are indicated in uppercase and lowercase letters, respectively. Introns, all flanked by consensus donor and 
acceptor sequences, were determined by comparison of genomic and cDNA sequences. Putative lariat boxes, as determined by comparison to the 
consensus sequence described by Parker et al. (1987) in. yeast, are underlined. Arrows indicate restriction enzyme cuts used to construct the 
disruption plasmid (see below) . 

analysis. Two independent transform ants showing the ex- 
pected hybridization pattern were phenotypically identical, 
and only one was studied further. This strain, carrying the 
hmgAwargB* disruption, grew no rm ally on lactose or on mix- 
tures of lactose and Phe or PhAc but was unable to grow on 
either Phe or PhAc as sole carbon source (Fig. 4B). In addition, 
hmgA disruption resulted in secretion of a red pigment on 
growth media containing Phe or PhAc (Fig. 4B). This pigment 
is visible even on Phe or PhAc plates, on which there was only 
residual mycelial growth of the mutant strain (due to utiliza- 
tion of other carbon sources present in the agar). The 
hmgAv.argB* gene disruption is recessive in diploids both for 
growth on Phe/PhAc plates and for secretion of the red pig- 
ment. Thus, hmgA integrity is required for Phe/PhAc catabo- 
lism, and its loss-of-function results in external accumulation 
of a red pigment. 

Disruption of hmgA Results in Secretion of Homogentisate 
due to Complete Absence of Homogentisate Dioxygenase— -Hu- 
man alkaptonuria (resulting from loss of homogentisate dioxy- 
genase) causes ochronosis of connective tissues due to the ac- 
cumulation of a dark pigment, which is an oxidation product of 
homogentisate. Similarly, the red pigment produced by the 
hmgAv.argB* strain turned dark brown after prolonged incu- 
bation times on culture plates or after a much shorter period in 
shaken (more aerated) liquid cultures. This suggested that this 
strain might secrete homogentisate. HPLC analysis of filtrates 
of cultures pregrown in glucose and transferred to Phe minimal 
medium (Fig. 5 A) confirmed that, in contrast to the hmgA* 
strain, the hmgAwargB* strain secreted homogentisate. This 
was strongly indicative of a block in homogentisate dioxygen- 
ase. Indeed, while substantial homogentisate dioxygenase ac- 
tivity was induced in hmgA* mycelia upon transfer to Phe, no 





Fig. 3. Transcription of hmgA is induced by PhAc or Plie. 
Northern blot hybridization of total RNA Isolated from mycelia trans- 
ferred to carbon sources indicated (- carbon indicates no carbon source 
added; see "Experimental Procedures" for concentrations of carbon 
sources). The probe was a ^P-labeled cDNA containing the complete 
hmgA ORF. Loading in each lane was equal, as determined by hybrid- 
ization with an actin probe (not shown) . 



We describe below genetic and biochemical evidence demon- 
strating that this is indeed the case, the Aspergillus homogen- 
tisate dioxygenase gene being designated hmgA. 

Disruption of the hmgA Gene Prevents Growth on Phe and 
Results in Secretion of a Red Pigment — In order to determine 
the Ioss-of -function phe no type of hmgA, a plasmid containing a 
genomic DNA fragment internal to the ORF was targeted to the 
hmgA locus by homologous recombination (Fig. 4^4). The 5 '-end 
of this fragment is located within the first intron (thereby 
lacking the promoter, the initiation codon, and the first exon), 
and its 3'-end is at codon 347 (thus lacking the 101 C-terminal 
codons). Homokaryotic transformants in which integration 
took place by single-copy homologous recombination in the 
resident hmgA gene were selected and verified by Southern 
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Fie 4. Disruption of the hmgA gene 
and growth tests. Panel A, a plasmid 
containing a internal fragment of hmgA 
was used in a transformation experiment 
to disrupt the hmgA gene, as shown in the 
scheme (see "Experimental Procedures" 
for details). This integration event results 
in two incomplete copies of hmgA, one 
lacking the 101 C-terminal codons and 
the other lacking the promoter region, the 
first exon, and part of the first intron. Two 
different transformants having identical 
phenotypes had this integration event, as 
established by Southern blot analysis us- 
ing hmgA and argB probes (data not 
shown). Open boxes denote the argB + 
gene; lined boxes denote the hmgA coding 
region (with three introns indicated); a 
wavy line indicates A. nidulans coding se- 
quences; and a straight line indicates vec- 
tor DNA. Numbers above the hmgA cod- 
ing region Indicate codon position. Panel 
B, growth phenotypes of the disrupted 
strain. Conidia of the hmgA + and of the 
hmgA::argB + strains were inoculated on 
minimal medium plates with the indi- 
cated carbon sources (lactose at 0.05% (w/ 
v), PhAc at 10 mM, and Phe at 25 itim). 
Plates were incubated for 3 days at 37 °C 
before being photographed. 
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such activity was detectable in mycelial extracts from the mu- 
tant strain (Fig. 5Bj. We conclude that loss-of-function of hmgA 
results in absence of homogentisate dioxygenase and conse- 
quent accumulation of homogentisate. 

hmgA Encodes a Protein with Homogentisate Dioxygenase 
Activity— To establish that hmgA encodes a homogentisate di- 
oxygenase, we expressed its protein product tagged with GST 
(Smith and Johnson, 1988) in E. coll A Nod cDNA fragment 
filled in with Klenow enzyme was subcloned in the fi!led-in 
Aval site of pGEX-2T. Recombinant plasmids in either orien- 
tation were recovered. The correct orientation (plasmid 
pGEX::HMGA) resulted in an in-frame fusion between the N- 
terminal GST coding region and codons 9-449 of the hmgA 
coding region (including the stop codon). The resulting fusion 
protein (predicted molecular mass, 76 kDa) was designated 
GST::HmgA (9-448). In contrast, in the reverse orientation 



Lactose + Phe 

(plasmid pGEX::AGMH), the GST reading frame is truncated 
by several in-frame stop codons present in the non-coding 
strand of the hmgA gene. As a second control, a third plasmid 
was constructed in which the hmgA coding region was fused to 
GST in the correct orientation but in an incorrect reading 
frame (pGEX::+2). Expression of the fusion genes in bacterial 
strains respectively carrying each of these three plasmids was 
induced with IPTG and homogentisate dioxygenase assayed in 
protein extracts. No activity was detectable in extracts of 
strains containing either the pGEX::AGMH or pGEX:: + 2 plas- 
mids. In contrast, high levels of homogentisate dioxygenase 
were detected in the extract corresponding to the GST fusion in 
the correct orientation and reading frame (Fig. 6A). A 76-kDa 
band {i.e. the expected molecular mass for the GST:: HmgA 
(9-448) fusion protein) was induced by IPTG in bacteria car- 
rying pGEX::HMGA (Fig. 6/3) but was absent in bacteria car- 
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FlC. 5. Disruption of hmgA results in secretion of homogenti- 
sate and absence of homogentisate dioxygenase. Panel A, HPLC 
analysis of culture filtrates of the hmgA* and the hmgA::argB* strains 
after transfer to defined medium with 25 mM Phe as sole carbon source. 
The detector was set at the wavelengths indicated. Homogentisate has 
a characteristic absorption maximum at 290 nm. Standards were as 
follows: /, 3,4-dihydroxyPhAc; II, homogentisate; III, 2,5-dihydroxyben- 
zoate; IV, Phe; V, phenyl pyruvate. Panel B, homogentisate dioxygenase 
activity in mycelial extracts from the wild type and the hmgA..aigB + 
strains. 

rying either of the control plasmids (data not shown). To prove 
that induction of homogentisate dioxygenase resulted from ex- 
pression of the GST::HmgA (9-448) fusion protein, it was 
affinity-purified from crude extracts. This purification resulted 
in a nearly homogeneous preparation having 1 0 times higher 
specific activity than crude extracts (see Fig. 6, A and B). We 
conclude that hmgA encodes a homogentisate dioxygenase. 

We noticed a significant loss in the activity of the 
GST::HmgA (9-448) fusion protein after purification by gluta- 
thione -Sepharose chromatography. The specific activity of our 
purified fusion protein (Fig. 6/4) is 7.5 times higher than that 
reported for murine homogentisate dioxygenase (Schmidt et al, 
1995). These figures represent a minimal estimation for the 
specific activity of the Aspergillus enzyme, as our protein has 
been produced in a heterologous host and we cannot exclude a 
negative effect of the N-terminal GST moiety on enzyme activ- 
ity. Our purified GST::HmgA (9- 448) fusion protein showed no 
activity when the reaction was carried out in the absence of 
Fe 2+ ions, as described for purified murine homogentisate di- 
oxygenase (Schmidt et al, 1995). In contrast, the enzyme was 
fully active in the absence of ascorbate. 

Identification of Human and Plant Homologues of hmgA— A 
search of the conceptual translation products of GenBank and 
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Fig. 6. Expression in E. coli and purification of a 
GST::HmgA(9-448) fusion protein with homogentisate dioxygen- 
ase activity. Panel A, assays of homogentisate dioxygenase activity in 
crude and purified extracts from bacteria carrying the plasmids indi- 
cated. The scheme shows relevant features of the GST fusion genes. P** 
indicates the promoter of pGEX2T. Panel B, SDS-polyacrylamide elec- 
trophoresis of the GST::HmgA(9-448) protein fusion expressed in E. 
coli. Samples were as follows: lane 1, size standards (Bio-Rad); lane 2, 
purified fraction after elution with glutathione; lane j, cmde extract 
after passage through the glutathione- Sepharose column; lane 4, crude 
bacterial extract; lanes 5 and 6, cells lysed in sample buffer, 3 h after 
induction {lane 5) or without IPTG induction {lane 6). Note the -76-kDa 
band induced by IPTG, retained by the affinity column, and eluted with 
glutathione (indicated by an arrow). 

EMBL data bases with the HmgA amino acid sequence re- 
vealed no similarity to any polypeptide encoded by an entry 
with an assigned function. However, ESTs encoding deduced 
polypeptides with significant sequence identity to HmgA were 
identified (Fig. 7). Four of these are human ESTs, of which 
three were from either liver or mixed liver/spleen libraries. Two 
of them (apart from probable sequence ambiguities) appear to 
be identical, their deduced amino acid sequence showing 50% 
identity to a 93-residue region in the C -terminal half of A. 
nidulans HmgA. The deduced amino acid sequence of the third 
showed 42% identity to an 89-residue N-terminal region of 
HmgA (Fig. 7). A fourth EST, putatively encoding a polypeptide 
showing similarity to this N-terminal region, was from pancre- 
atic islets. This amino acid sequence differed from that of the 
liver clone in only three different positions. 
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A 

Human ESTs 

aa 219 LGPIGSNGLANARDFQAPVAAFDDEEGPTEYRLYSXFNNHLFS^ 310 HmgA 

LGPIG NGLA RDF FA D P Y < LF A QD PF VAWHGNY PYKY L ? DPSI TVLT S CONSERVED RESIDUES 

T83013 

(nt 3) LGPIGANXLAXPRDFLIPIAtfYEDROVPGGYTVIN.<YO£KLfc7v\KUUvK^ ) DPSI FTVLTAKS + 3 EST IlOSOy !>' 

fetal liver stple 

R00700 

(nt, 3) TrGPTGANfi KGNY7PYKYNLKK?( )DPSI LTAKS +1 EST 127783 5' 

LJOtPRDFLIPIAhTEDRQVPGGYTVINl^YQGKLFAAKQDVSPFNVVAW STV +3 fetal liver sple 



aa 17 GFDSYHESEALEGALPVGHNSTOKAPYGLYAEKl^GTAFTAPRHENKOTW^ 105 HmgA 

G E G LP G N PC PY LYAE LSG AFT PR NK W Y I LP H F T PNQ RW CONSERVED RESIDUES 

(ntl60lGNECSSEDPRCPGSLPEGO>'NPQVCPYNLYAEQLSGSAFTCPR +1 T5S939 

( )NK *2 EST 77725 5' 
aSWLY ( ) LPSVSHKPFGIHLT-^AMFTHNKGMEVDPDPNQXRW t3 Liver 

0 □ 0 T21323 

(nt 9 2 ) GNECSSE DPRWPGSLPEGO/NN PQVC PXXLY AEQI SGSR +2 EST hbc 2645 5' 

FTCPRXTNKRSWLYWILP +3 Hutuan pancreatic i«1*t« 



B 

PLANT ESTs 



aa 14 YQ/<GFDSYHESEAIEGALPVGHNSPQKAPYGLYA£XLSGTAFTAPRH£NKC>TW t>3 HmgA 

YQ GF SEAJ GALP NSP PYGLYAE SGT FT PR N W YR P HE Ff >P QL/W p D CONSERVED RESIDUES 

(nt44} YQSGFGWHFXSEAIAGALPLDONSPL^CPYGLYAEQISGTSFTSPRKXNQRSWLYRVKPSVTHEPF +2 R29958 

t >PTQL3WRPeDIPDS +3 A. thai i ana EST 163B13T7 



aa 34 8 MGLITGNYDAKTGGGFQPP^SASLKNIMSAHGPDMHAFEGASNADLXPTKIGDGSKAFMFESSLMV 419 HirgA 

MGLI GYAK GFPGASK M HGP E( )G MA MFES L V W L CONSERVED RESIDUES 

T2O540 

(nt 2911 MGLI^GAYXAKXDG FXPGGASXKSCMXPHGPXTTTYE { ) GTMAXMFESALI PRVCXWAL -1, A. thaliane EST 86F8T7 



aa 362 GFQPAGASLHNIMSAHGPDMHJ^FEGASMADLKPTKIGD^ 421 HngA 

GF V GAS H M HGF-D E( ) MACMFEO L W CONSPRVP.D RRSTnURS 

(ntl3) GFVPGGASXHSCMTPHGPDTKTYE ( JTMAFMFESCLIPRICLWAVES +1 Riclnus ccirmunla EST pors789 

Fie 7. Identification of human and plant homologues oftlie hmgA gene. Sequence comparison between HmgA and deduced polypeptides 
encoded by human and plant ESTs. A, human 5'-ESTs. Identical residues are shown below the indicated regions of HmgA. Deduced sequences 
encoded by ESTs were grouped in two pairs, each corresponding to a different region of HmgA. It should be noted that sequences codified by T55939 
and T27323 differ in three positions (residues in boldface (indicated by boxes): see also text). Conserved residues are not indicated for clarity. 
Residues marked as A' represent ambiguities in the nucleotide sequence of the EST. Numbers on the right and left ends of HmgA sequences denote 
the positions of the amino acid residues in the protein. Numbers {in parentheses) on the left side of each EST polypeptide refer to the nucleotide 
position of the first translated codon in its corresponding DNA sequence. In several cases, frameshifting allows alignments with significant 
sequence conservation, possibly reflecting sequencing errors in ESTs. Therefore, deduced sequences corresponding to different reading frames 
(Indicated on the right) of the same 5 ' EST are shown when necessary. Parentheses indicate regions of deduced sequence with no evident similarity 
in any of the three EST reading frames. Also indicated on the right are the GenBank accession numbers and the source of cDNA. The liver spleen 
library is a mixed library. B, plant 5 ' ESTs. Sequence comparisons were shown as in A, indicating the organism from which the libra ly was 
constructed. 



The remaining three ESTs were from plants, two being from 
Arabidopsis. Their deduced polypeptides showed 44 and 43% 
identity to non-overlapping HmgA regions close to the N- and 
C -terminal regions, respectively. The polypeptide encoded by 
the third plant EST (from Ricinus communis) has 36% identity 
to the above near C-terminal HmgA region. 

DISCUSSION 

We describe here the first characterization in any organism 
of a gene encoding an homogentisate dioxygenase, the A. nidu- 
lans hmgA gene. Homogentisate dioxygenase activity is 
strongly induced in mycelia by Phe or PhAc, and hmgA cDNA 
clones were easily isolated using a subtracted cDNA probe from 
a library enriched in cDNAs for PhAc-induced transcripts. 
hmgA is essential for growth on Phe (or PhAc) as sole carbon 
source. This supports the A. nidulans Phe (PhAc) catabolic 
pathway as shown in Fig. 1 . Disruption of the gene results in 
complete absence of homogentisate dioxygenase activity. 
Therefore, the disruption created here is almost certainly a null 
allele. Due to this defect, this strain, when supplied with Phe or 



PhAc, secretes homogentisate, which is readily oxidized to yield 
a reddish pigment, eventually turning dark brown. The com- 
plete absence of enzyme activity in the disruption strain to- 
gether with the absence of cross-hybridizing bands in genomic 
Southern blots strongly indicates that A. nidulans contains a 
single gene encoding homogentisate dioxygenase. 

Mammalian homogentisate di oxygenases contain weakly 
bound ferrous ions that are required for activity (see Schmidt et 
ai (1995) and references therein). A 70% reduction in activity 
was also observed 2 when crude Aspergillus extracts were as- 
sayed in the absence of Fe 24 " ions, and the activity of the 
purified GST:: HmgA (9-448) fusion protein showed an abso- 
lute requirement for these ions. The deduced HmgA polypep- 
tide contains 21 His and 23 Tyr residues. Some of these resi- 
dues might be involved in binding iron, as demonstrated for 
protocatechuate 3, 4 -dioxygenase (Ohlendorf et ai., 1988). 

The deduced HmgA sequence was used to identify human 
ESTs potentially encoding a homologue(s) of the fungal gene. 
The high similarity detected at the amino acid level establishes 
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that human genes corresponding to these ESTs are indeed 
hmgA homologues. The EST-encoded amino acid sequences are 
classified in two groups, each corresponding to a different re- 
gion of HmgA. Therefore, our results do not establish the 
existence of a single human gene encoding homogentisate 
dioxygenase. 

Three of these ESTs were isolated from liver or liver/spleen 
cDNA libraries. Alkaptonuria results from loss of homogenti- 
sate dioxygenase, as demonstrated in both liver and kidney 
extracts. Our preliminary identification of cDNAs for homogen- 
tisate dioxygenase, nearly a century after alkaptonuria was 
recognized by Garrod (1902) as an inborn error of metabolism, 
represents a significant advance in the characterization of the 
human gene and further illustrates the validity of our fungal 
metabolic model for disorders in human Phe metabolism (Fer- 
nandez-Candn and Penalva, 1995). Definitive evidence that 
alkaptonuria results from loss-of-function mutation(s) in the 
homogentisate dioxygenase gene will require mapping of this 
gene to chromosome 3q2 (the location for alkaptonuria (Pollak 
et al f 1993; Janocha et al. t 1994)) and identification of muta- 
tions in patients with the syndrome. Finally, availability of the 
human gene will allow examination of tissue-specific expres- 
sion. One of the ESTs identified here was isolated from pan- 
creatic islet cDNA. Homogentisate dioxygenase activity in this 
tissue has not been reported previously. 

We have previously reported a class of mutations {suAfah) 
suppressing the effects of a complete fumarylacetoacetate hy- 
drolase deficiency in A. niduJans (Fernandez-Canon and 
Penalva, 1995). The phenotype of these suppressor mutations 
is indistinguishable of that caused by hmgAv.argB* mutation 
(i.e. loss of homogentisate dioxygenase and secretion of ho- 
mogentisate), and they do not complement the disruption in 



diploids. These suAfah mutations are therefore hmgA~ alleles 
(Fernandez-Canon and Pefialva, 1995). This suppression led us 
to propose that alkaptonuria would prevent the lethal effects of 
human type I tyrosinaemia by blocking the pathway upstream 
of fumarylacetoacetate hydrolase. Identification of mammalian 
homologues of hmgA reported here will facilitate testing this 
hypothesis in animal models. 
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